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1 ^ Intro duct OCT 

In a recent article I made attempts to test, by multiple 
correlation calculations^ some versions of theories on income 
distribution in wMch one or two of the explanatory variablea are 
the average level and the distribution of education^?^* Since I 
wrote that article new material has come to my Imowledge which made 
it tempting also to use this material for the ame purpose. Three 
studies by Americans , namely T. Paul Schultz £^S^ ^ Leland S. Burns 
and H*B. Freeh III / \J and Barry Chiswiok £^2,3 based on an 
intei^esting and large amount of information, have b^en the basis for 
the present study, which also contains some material selected and 
processed by the present author; The material consists of data on 
subdivisions of three countries^ the United Statas, Canada and The 
Netherlands, Although the authors mentioned adhere to theoriea of 
income distribution Gomewhat differGnt from my own theory ^8_7j 
thoir matorial ,can be used also to test the latter, subject to some 
assumption B, The matGrial added by my own raodest extension seems to 
fit the purpO£^e somewhat better, however, and suggests some further 
research in that direction , The present article constitutes a progress 
report only, to be followed by further work. As set out already briefly 
in the v/ork quoted, the main difference between my theory and those of 
the present American school, grouped aroimd such well-known authors as 
T.W* Schultz, A. rdncer and others, is that. I introduce demand by tht 
"organisers of production" for nkill or qualification alongside with 
supply, Domand has ba©n mentioned by T,P, Schurtz / 6, p, 13^, but 



* I want to expreos my oinoero thanks to my collaborators A. ten Kate, M,Sc, 
and H. Visocher for the prograraming of many caloulafcions used in this 
article* 



not included in his explanatQry variables. One of the points of 
focus of this essay thsrefort consists of attempts to five practical 
shape to the intrDduation of variables supposad to reprassnt damand. 
But I also want to add an amXuexon in some previoua presentation.s of 
this deaand-supply theory, Togethe ^ Kith a few more refinements the 
theoretical bass chosen will b© set out in Section 2* Some oharaote- 
ristics of the testing material used will be discussed in Section 3, 
In the remaining sections some results obtai.ned for the three countries 
mentioned will be shown and compared with results obtained by others* 

2. The Demand-^Supply Theory of Incom e Distribution 

The simplest theories of price fomation for single commodities 
can be summarised by saying that they assume the existence of a demand 
equation and a supply equation, both containing the quantities traded 
and the price as variables. Xn the demand equation one or more other 
variables are added oharacterizinf the position of those who demand | 
in the supply equation one or more variables are added onaraoteriEing the 
position of suppliers* These additional variables have been indicated as 
demand factors and supply factors, respectively. By the elimination of 
quantities traded we can, retain a "price formation equation*' which 
explains the prico in terms of the demand and the supply factors. In a 
way the difference between the values of the demand factors and those 
of the supply factors^ when brought on some common denominator^ can be 
called the tension between demand and supply quantitiee just mentioned. 
This is why I sometimes indicated the demand-supply theory as the "tension 
theory"* We can consider as a dummy variable for demand factors the number 
people of a certain skill needed by the organiEers of production and as one 
the diimmy variables for supply factors the number of people which by their 
education and £ Its contents could be briefly surMariied ay the propo- 
sition that high incomes will be paid to qualifioations for which there 
is a high tension and low income to qualifications for which there is a 
low, even a "negative" tension, namely where supply surpasses demand* 
The incomo diotribution may then be derived from the distribution of 
qualifications required and qualifications available. Incomes could 

£ other factors possess this skill. 
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become almost equal if there is no tension between the two distri- 
butions* People would not need to be of equal productive quality in 
oMer to attain this near-equality of incomes- 
One condition to be fulfilled for any attempt to test the 
demand-supply theory is that the geographical units compared in a 
cross-eeetiDii or time series analysis be large enough to oontain 
both the demand and the supply location. For cOMuters there is a 
distinction between the plaoe where they work (and where the demand is 
exerted) and the place where they live (wehre the supply is shown)* 
This implies that cross section studies using single aunicipalitiesj 
such as the Burns-Freeh study and some of T,P, Schulta-s investigations, 
may lead to unreliable results. For that reason I haire preferred to use 
data for the (eleven) provinces of the Netherlands only 3 as also done 
by Schultz, 

Material Used in the Presenit, Study; some Lacunae 

As already observed^ this study deals with cross-section 
analyses for three countries. The figures refer to the states of the 
United States (Ghiswick)^ the provinces of Canada (same author) and a 
number of raunioipalities (Burns and itech), the socio-geographic areas 
and the provinces of the Netherlands (Sohultgp Tinbergen)* Burns and 
Freeh in particular have chosen the 71 largest municipalities, Schult^ 

aelected at random and both Schultz and I took the eleven provinces 
Df my country. The advantage of the type of material chosen consists of 
horaogeneity in cultural and other respects, partly i^nown even, which 
does not exist for cross section studies among widely differing countrlee 
as carried out by Lydall £^^J and myself £'lj^ This homogeneity is also 
lacking in time series studies, because of changes both in the system of 
education and in the tealmolo^ of production* 

There are also disadvantages connected with cross-section - 
studies within a single country | one was mentioned alreadyi commuters 
do 110 1 always work and live in the sam© geographical unit. Another is 
that vfiriations within one country, oapeoially a small country, may be 
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SO restricted as to a liindrance to extrapolations, which are the 
main instruments to arrive at the more interesting answers we want to 
deriira from our studies* 

Pinally the material used in this article euffers from some 
lacunae because time did not permit me to calculate the demaid 
variable (which for the Netherlands gave the best results) for the two 
larger countries^ It is my hope that this lacuna can be filled up on 
a later occasion. Similarly^ the yardaticks used for income inequality 
have been different and also this lacima may be filled later. The 
variables used in this article have been listed and defined in Table I* 

Table I* List of variables used by authors quoted (tJSA = United States 
of America; CM ^ Canada, NL = Netherlajrids) * 
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Chiswick (UAS + GDN) 
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Note I capital letters are used for variables in units indicated 1 
case letters mil be used for "normalised" variables (i,e* 
0, atajidard deviation I). * meansi variable not used. 

1) For i960 Percantagi of activ© malea with higher education, 

2) Total Population, 
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4« Using ChiswickVs Mate rial for th e United Statea 

For each of the data collections analysed we used two ways 
of measuring the variables usedi the "natural units" as indicated 
in Table I and nomalised units (with zero average and unit standard de- 
viation) I the latter being indicated by lower case letters* We at*- 
tempted to study the structure of relationship by comparing repression 
coeffioients found in different combinations for the same variable. 
Ghisv/ick's material on the United States was used to construct Table II, 



Table II* Regression and multiple correlation coefficients (r) found 
for different combinations of variables explaining income 
inequality 



No. 


Regression 


coef ficienta 


for explanat 


ory variable 


R 






.1 


-0.79 


• 


u 
« 


V 


0.79'^ 






2 




-0.73 




• 


0.73 






5 






+0.48 




0.48 


1 ; 


^xplp var* 


4 




ft 


* 


-0.86 


0.86 






5 


-0.60 


-0.25 


• 




0.80' 






6 


-0.71 


• 


+0.25 




0.825 




expl* var. 


7 


+0.08 


« 


• 


-0.94 


0.86 _ 






8 


-0.02 


+0,15 


+0.31 




0.83"j 


) 




9 


+1 .25 


-0*67 


• 


-1 .58 


0.93 


3 


expl, var. 


10 


+0,65 




+0,42 


-1 .38 


0.94 J 






11 


+1 .02 


-0.33 


+0.315 


-1.52 


0,94 


4 


expl* var. 



Sources I 3J9 Table 3-3* 

We did^ not use all the variables shown in Chiswick's study, 
for inatanco not his variable r^^ the rate of return on education derived 
for each state from the regression, in that statt^ of income over 
schooling. My feeling was that its use would duplicate the variables 
Z and U, since GhiBwick's (and MInQor's) theory is that the choice of 



everybody's length of schooling is partly based on It seems 
that indeed r is superfluous ^ even statistically; there appears 
to be comp].ate multicollinearity in the set {x^ y^g, u,Vs 
The following conclusions seem warranted i 

The influence exerted by variables u (education inequality) 
and V (representing other influences on income^ euoh as innate 
capabilities) are stable | variable v always raises considerably the 
correlation coefficient. The contribution of is less important, 
but stable. The influence of taken here to represent the demand 
for qualified manpower, looks uncertain, since positive as well as 
negative regression coefficients are found* Negative coefficients 
occur when and only when v is excluded. The cases with the highest 
multiple correlation coefficients show a positive regression coefficient 
for X* influence exerted by variable is negative in most cases t 

These statements induce me to select case no* 11 as the most satisfactory 
relationship found with the aid of Ohiswick'a material* 

Using natural units we must divide the corresponding symbols 

by their standard deviations^ ^ven below (sourcei Table G-5)i 

0 = 0.12; 0 ^ 0.231 o ^ 0*79l 0 ^ 3.17l o - 0*29| the relation 
n y % 'u 'v 

then becomesi 

X ^ 0.53 Y - 0.05 Z + 0*012 U - 0.63 V (4.I) 

As an illustration of the influence which a higher level and 
a more equal distribution of education may exert, we assume an increase 
in schooling years of 2 and a reduction of its variance by 41 such 
changes would load to A-X - 0*10 - 0.05 ^ - 0.15* Since the average 
value of X| that is X s 0*79* this represents a very modest reduction 
of inequality in income in the United States; it reduces the standard 
deviation of incomes from \^0.?9 to ^0*64 or from 0*89 to 0.80 or by 
10 per cent only* As we shall see for the ease of the Netherlands, the 
coefficients for and U may become larger, however, if Y is replaced 
by a better measui'e for demand. 
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5* Using Chlswick' s Material for Canada 

Chiswick has collected for Ganad ■ the same mterial aa for 
the United States, Some of the results obtained with its aid are 
given in Table III* 

Table III. Regression and multiple correlation coaffieients R 
obtained from different combinations of variables 



explaining x. 


No. 


Regression 


coefficients for 


explanatory variable _R 








J. 


u 








1 


-0.62 




• 


• 


0.62' 




2 


• 


-0.54 




■ 


0.54/ 




3 


• 


* 


-0.15 




0.15 ( 


one expl* 


4 


* 


• 




-0.67 


0.67^ 


variable 


5 


-0.55 


-0.09 






0.625 




6 


-0.85 


• 


+0.38 




0.68 


i 2 expl. 


7 


+0.08 




• 


'0.74 


0.67 


f variables 


8 


-1.93 


+0.90 


+0.91 




0.76 ' 




9 


+1.17 


-0.49 




-1 .48 


0,72 


^3 expl. 


10 


+0.59 


• 


+0.82 


-1 .83 


0.85 . 


variables 


11 


+0.10 


+0.27 


+0.92 


-1 .61 


0.86 


4 expl, var* 



Source I / jj^, Table 3-^12. 

From the table we see that the influence exerted by ^ and 
^ is unstable, whereas the one exerted by u and v is relatively stable* 
Also, Inclusion of ^ or v considerably raises the correlation coefficient. 
TransfoiTQing equation (ll) into one with the units used by Chiswick 
and mentioned in Table I, we obtain i 

ofe - 0^ * °-27 ^ + 0.92 Oi = 1 .61 0^ 

or X = 0,043 y + 0.031 2 + 0.077 U - 0.56 V (5.1 ) 



In contrast with the rt .It for the United States, there 
is a positive influence of the average IsTel Z of education on 
inaome inequality X; this implies that the average level would 
already be too high. A possible explanation may be in the fact 
that in Canada education is obligatory too a larger extent than 
in the United States; at least for Great Britain this ar^iment is 
used by Chiswick £ zj and in this respect C^ada probably is some^ 
what closer to Britain than the United States, 

Considering that 0 ^ 10*69, we may think of a reduction 
in the inequality of schooling as a means to reduce income inequality 
and estimate the influence of ^U = - 5^ meaning that the standard 
deviation in years of schooling reduces from Vl 0*69 to VS. 69 or from 
3-27 years to 2.36 years. We obtain: 

= - 0*385 (5.2) 

Since X a 0,63, this b -ings inequality as measured by X to 
less than one half ^ but vrhen measured as a standard deviation in the . 
natural logarithms of income from VoTSF to ^^0*445 or from 0*795 to 
0*666 ^ a reduotion by 16 per cent only* 

A common feature to the equations found for both the United 
States and Canada is that raising the Y^, standing for other factors 
than schooling which determine an individual's productivity ^ reduces 
inequality in about the same way. This may in part reflect the 
influence of the "environment'^, among it the influence of the 
education of the parents* If this interpretation is correct, the 
lon^-run influence of education may be considerably stronger than 
the direct influence' estimated* 

6. Research on the Netherlands by T*P* Schultg and by L,5. Burns 
and H.tl> Freeh III 

Schulta^s contributions /"Sf p. 352_7 to the explanation of 
income inequality consist of having collected a vast collection of 
statistical data, for 11 provinces, for 75 regions and for 88 
municipalitlea selected in a random sampls (p, 339/340) and of having 
analysed various relations in order to explain ohanges over time with 



the aid of various explanatory variables as well as of having 
studied cross section data. For this article the latter are the 
more relevant analyses. Income inequality among regions as well 
as among provinces, meaeured by their Qoncentration ratio have 
been explained by a variety of variables, including the level of 
education, for which Schults found a positive influence. No use 
is made of demand factors^ which prevent us from testing the 
demand-supply theory. The other explanatory variables include 
number of tax payers, unemployment and wealth. The best results 
are obtained for the most recent year studied by him, 1958, and for 
the provinces. This seems to confim the viewpoint that the geo- 
graphical units should not be chosen too small. With the aid of 
the education level (measured bb the percentage of active population 
having had higher education) a corrected correlation coefficient of 
0*89 is obtained. This result comes close to my own results, to be 
discussed in Section 7, 

Bums and Freeh used the figures for 71 of the larger munici- 
palities. Their material enabled me to compute Table IV, where the 
symbols are those exflained in Table I. 

Table IV. Regression and (multiple) correlation coefficients R found 
for different combinations of variables explaining income 
inequality x* . 



No, Regression coefficients for • R 











1 


-0.91 




c 


2 


« 


-0.50 


* 


3 


• 


• 


-0.68 


4 


-0.93 


+0.02 




5 


-1 .05 


• 


+0,175 


€ 


-1 .04 


-0.02 


+0.177 



0.91 ] 
0,50 (, 
0.68 i 

0.91 ^ 

0.91 J 



1 expl, variable 



axpl, variables 



0.91 3 expl. variables 

Source; ^"^J, Table lb, and figures on ^' kinclly supplied by the 
authoro. ♦ 
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These reaults may be interpreted so as to attach the main 
role in the explajiation to incomes, with a clearly negatiYe 
influence. The influence of the two education variables is secondary, 
with that of the level of education uncertain even as to its 
algebraic sign^ whereaB inequality of education shows a positive 
influence. If income ^* can be considered as a demajid indicator 
for high qualification J its influence should be positive and so 
interpreted the demand-supply theory is rejected* But I have some 
doubts, already announced * whether not the geo graphical units taken 
are too small* A sroup of typically commuter municipalities, whose 
commuters work in the nearby large cities tosterdam, Rotterdam and 
The Hague, do not reflect the demand for the comnuters' qualifications. 
They happen to have high incomes and at the same time low inequality 
of incomes. Later (Section 7) we will find that for the larger units i 
the provinces, a completely different situation prevails* 

7* Fiarther Research on the Netherlands 

In an attempt to test the demand-supply theory I tried to 
construct a slightly more precise indicator for demand. From the 
American 1960 Census of Population quoted in ^3 J the percentage 
of the manpower with higher education was found for \ ; four main 
sectors* agriculture j manufacturings trade and transportation and 
services (defined as the remainder). For each of the Butch provinces 
the total number of persons active in the four main sectors are 
knovm from the Butch''f960 Cernus of Population. Multiplying the 
percentage with higher educn ion needed, taken from American figures a 
(probably overastimated) ind *x of demand was derived. On the supply 
side, two indicators were used, in order to open up the possibility 
of different weights to be given to manpower with seQundary and 
manpower with third-level higher education. At the sMe time it was 
assumed that private cost of tliird level education is related with 
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income foregone, to be represented by a constant, reflecting income 
of people with secondary education only. 

The demand-supply theory was given a shape better adapted 
to the data available. As the Yariable representing income inequa-- 
lity we considered the upper decile income divided by average 
income (in Lydall^s E^J notation P^q)* Demand for and supply of 
people with higher education were represented by d^ ^ d^ and 
s^. + respeotively, where the indices 1 and 2 represent two sub- 
groups i group 2 /:>eing university ^aduates and group 1 representing 
all other people v/ith higher education. As set out in Section 2, 
the differences between demand and supply were taken as two expla- 
^ nato^ variables, but the possibility was kept open that the weights 

of the two differences d^ - and d ^ " % ^® diff erenti a 

scarcity in category 2 may be more important to explain inequality 
than the same scarcity in category 1 • Taking into account that in 
the absence of inequality X* • must be 1 and that our method of 
calculating quantities demanded is based on Ameilcan figures, a 
formula of the following shape was tested i 

X'« ^5 ^ (d^ - s^) + C 2 (dg " Sg) + 1 + c (7.1) 

where £ indicates the correction for the use of American figures. The 
data available do not permit us to introduce d^ and dg separately, 
however* For this reason we combine 5^d^ + 5 gig 51*' specify 
^ the correction tern c^ to be 5 (y* ' " ^o'^ where the suffix o refers to 

the United States* Replacing and s^ by Z« • « ^ U« • and U» ' (of, 
.. Table l) respectively, we finally obtain, for the purpose of testing 
the demand-supply theoryi 

Xt» = 5Y*« ^ %^ (Z''' ^ U»») - SgU'' + 1 + S(Y" ^ Y^O 

Our best result obtained runsi 

^ 1,21 Y** ^ 0.08 Z''» - 1.16 U«» ^ 11.4 (R =^ 0.96) 

■ (7.2) 

This is equivalent to putting S« 1.21 J 5^ = 0,08 and %^ = U08. 

d 12 
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This would leave us with an eetimata of Y" - Y' « = - IO.5, The 

direot eitimate of the ptrGentaga of active population with higher 

education in both oomitrieg yields 

Y'* = 10.4 I Y«» = 19.1 
0 

implying a value for ?• * ^o* ^ ^ 8*7* In order to teat the 
etability of the regression coafficiants found, we aonstruoted 
Table oomparable with Tables II, 111 and IV, using nomaliged 
variables* 



Table V* Regression and multiple oorralation Qoeffioients R, found 
for different combinations of variables explaining incoffle 
inequality * ■ ' 

No. Regression ooefficients for axplanatory variable R 



1 


0.84 


* 




2 




0.81 




5 






0.70 


4 


1.03 


-0.20 


« 


5 


2.50 




-1.72 


6 


a. 95 


-0.42 


-1.75 




one axplana-* ■ 
tory variable 



2 explanV 
variables 



0*96 3 expl. var* 



The negative influence of the supply variables and the positive 
influence of the demand variable is confirmed by cases 4 5* 

In order to oompara these results with those for the two . 
other countries M.d those obtained by Buras md Freeh for the 
Netherlands (based on municipalities) we constructed similar tables 
for a few alternative variables! using instead of 2' ' (closer to 
Chiswick's material) in Table VI and instead of 2* * (Bums and 
Freeh) in Table YII. ■ ' 
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Table VI* Regression and multiple oorrtlation coefficients R found 
for different combinatione of explanatory variables to 
explain x' ' . 

No* Re^eeeion coefficients fori R 





1. 




u' ' 




1 


0.88 


• 


• 


0.88 


2 




0.81 


• 


0.81 


3 


■ 


• 


0.70 


0.70 


4 


0.92 


-0.04 


• 


0.88 


5 


1.02 


• 


-0.17 


0.88 


6 


0.89 


+0.27 


-0.31 


0.89 



Table VII* Regraesion mni multiple correlation ooefficiente R found 
for different oombinations of variables to explain x* 



No. Regrasaion coefficients fori R 









u' • 




1 


0.92 






0.92 


2 


* 


0.89 




0.89 


3 


• 




0.90 


0.90 


4 


0.87 


-0.055 




0.92 


5 • 


0.91 




+0.092 


0.92 


6 


0.89 


-0.054 


+0.083 


0.92 



The reeulte presented in the last two tables are less 
satisfactory than those of Table V* the multip^ j correlation 
coeffioienta are lower and the supply influences are small and 
uncertain p 

6* Some Preliinlnag^ConcluslQnB 

The only case where, in the present essay, a considerable 
influence of the level and the inequality of education on income 
distribution ie^¥ound, ie equation (7*2). In order to reduce income 



inequality, as measured by the highest dseile divided by average 

income y to half of its 1960 value, that is, in order to attain 

_2__ 

AX ^ 2, we need U' ' m = 1.86, meaning that the peroentaga 

1 • Uo " 

of the population with university education should somewhat more 
than doubled in eomparison to the 1960 situation, when it was 1.4 
per cent* Sueh favourable results were found in several other oasee 
reported on before 9 ^ i but most of the present results are 

much leas favourable in that sense, I^om the various versions of the 
relationship found for the Netherlajads one may wonder whether not 
perhaps the use of the demajid indicator as defined in Section 7 
might ohange the Merican and Canadian figures so as to show a 
stronger influence of eduQation level or distribution on income in- 
equality. Further work will be undertaken*. 

Another conclusion seems to be that municipalities are too 
small units to compare, because of the different "location" of 
demand and supply in our sense* 

In a last attempt to compare our cross seotion analyses we 
collect our "best" aases from the various tables in the order of 
goodness of fit (Tabl© VIIl). 

Table VIII, Regression coefficients and R found in six cases^^. 



Case 


R 


Regr.coeff .for x 


M. M, 


_v 


Country 


A 


0.96 


2.95 


-0.42 -1.75 


m 


Netherlands 


B 


0.94 


1.02 


-0.33 +0.315 


-1.52 


United States 


C 


0.92 


0.89 


-0.054 +0.083 




Netherlands (provinces) 


D 


0.91 


-1 .04 


-0.02 +0.177 




Netherlands (munic,) 


B 


0.89 


0.89 


+0.27 -0.31 




Netherlands (provinces) 


P 


0.86 


0.10 


+0.27 +0.92 


-1 ,61 


Canada (provinces) 



1 ) Primes used to distinguieh variables in Table I have been omitted 
in this table. 



* It is also conceivable that a longer-term influence on income distribu- 
tion may be implicit in the influence of variable as already observed 
in Section Sp a suggestion made to me by J, P. Pronk^ M.A, and sub- 
stantiated for Norwegian samples by Solto*wi, L, foward Income Equality . 

: :: in,Norw^, Madison 1965/ . -i ^^; ;; " r'::t^^^^^ 



- 15 - 



There are some regularitiee in this table worth mentioning* 
With the exoeption of case which we rejected beoause of tht 
use of too amall geographieal unites the cD#ffioients for ^ 
(or substitutes) fall and so do (even including case D) the negativ© 
coefficients for (or subetitutee) , ^ere availabltj the influence 
of V| reprtienting other faotors making for quality, exert 
considerable influence. This is an. argument in favour of introducing 
such additional variables, as done by Chlswick in an inventive way. 
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